课程视频链接: https://www.youtube.com/playlist?list=PLAwxTw4SYaPnFKojVQrmyOGFCqHTxfdv2 [TOC] 1. Unit 1 1.1 typical CUDA Program 1.2 parallel communication patterns 1.3 GPU allocate blocks to SMs 1.3 GPU memory hierarchy 1.4 high level strategies of optimizing performance